Detection of Local Disturbances and Simultaneously Active Speakers for Distributed Speaker-Dedicated Microphones in Cars
نویسندگان
چکیده
For automotive hands-free and speech recognition applications, distributed microphones are often mounted in the car where each of the speakers has a dedicated microphone close to his position. To provide additional control information for further speech enhancement, it is often advantageous to distinguish between the activity of the different passengers. In this contribution speaker activity is identified by the evaluation of the ratios between the powers of multiple speaker-dedicated microphone signals while further acoustic events are differentiated from single speaker’s activity. An effective algorithm based on the exploitation of the expected range of power ratio values is presented that is able to detect local disturbances like scratch noise at the microphones. Furthermore, situations are identified where multiple passengers speak at the same time. Besides some examples, it can be shown that the proposed method for local disturbance detection is able to identify such noise within realistic driving conditions.
منابع مشابه
Enhancement of speech in multispeaker environment
In this paper a method based on the excitation source information is proposed for enhancement of speech, degraded by speech from other speakers. Speech from multiple speakers is simultaneously collected over two spatially distributed microphones. Time-delay of each speaker with respect to the two microphones is estimated using the excitation source information. A weight function is derived for ...
متن کاملSpeakers Determination and Isolation from Multispeaker Speech Signal
In this letter, we address the issue of determining the number of speakers from multispeaker speech signals collected simultaneously using a pair of spatially separated microphones. The spatial separation of the microphones results in time delay of arrival of speech signals from a given speaker. The differences in the time delays for different speakers are exploited to determine the number of s...
متن کاملSpeech processing using digital MEMS microphones
The last few years have seen the start of a unique change in microphones for consumer devices such as smartphones or tablets. Almost all analogue capacitive microphones are being replaced by digital silicon microphones or MEMS microphones. MEMS microphones perform differently to conventional analogue microphones. Their greatest disadvantage is significantly increased self-noise or decreased SNR...
متن کاملMulti-modal recording, analysis and indexing of poster sessions
A new project on multi-modal analysis of poster sessions is introduced. We have designed an environment dedicated to recording of poster conversations using multiple sensors, and collected a number of sessions, to which a variety of multi-modal information is annotated, including utterance units for individual speakers, backchannels, nodding, gazing, and pointing. Automatic speaker diarization,...
متن کاملOnline blind speech separation using multiple acoustic speaker tracking and time-frequency masking
Separating speech signals of multiple simultaneous talkers in a reverberant enclosure is known as the cocktail party problem. In real-time applications online solutions capable of separating the signals as they are observed are required in contrast to separating the signals offline after observation. Often a talker may move, which should also be considered by the separation system. This work pr...
متن کامل